On the annotation of TMX translation memories for advanced leveraging in computer-aided translation

نویسنده

  • Mikel L. Forcada
چکیده

The term advanced leveraging refers to extensions beyond the current usage of translation memory (TM) in computer-aided translation (CAT). One of these extensions is the ability to identify and use matches on the sub-segment level — for instance, using sub-sentential elements when segments are sentences— to help the translator when a reasonable fuzzy-matched proposal is not available; some suchas functionalities have started to become available in commercial CAT tools. Resources such as statistical word aligners, external machine translation systems, glossaries and term bases could be used to identify and annotate segment-level translation units at the sub-segment level, but there is currently no single, agreed standard supporting the interchange of sub-segmental annotation of translation memories to create a richer translation resource. This paper discusses the capabilities and limitations of some current standards, envisages possible alternatives, and ends with a tentative proposal which slightly abuses (repurposes) the usage of existing elements in the TMX standard.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A CAD System Framework for the Automatic Diagnosis and Annotation of Histological and Bone Marrow Images

Due to ever increasing of medical images data in the world’s medical centers and recent developments in hardware and technology of medical imaging, necessity of medical data software analysis is needed. Equipping medical science with intelligent tools in diagnosis and treatment of illnesses has resulted in reduction of physicians’ errors and physical and financial damages. In this article we pr...

متن کامل

CASMACAT: A Computer-assisted Translation Workbench

CASMACAT is a modular, web-based translation workbench that offers advanced functionalities for computer-aided translation and the scientific study of human translation: automatic interaction with machine translation (MT) engines and translation memories (TM) to obtain raw translations or close TM matches for conventional post-editing; interactive translation prediction based on an MT engine’s ...

متن کامل

On the Relationship between Intralingual/Interlingual Translation and Speaking Fluency of the Iranian Advanced EFL Learners

Speaking as an initial goal in language teaching and learning has relationships with many variables including listening, reading, writing, knowledge of vocabulary as well as grammar. The present study mainly aims at examining the relationship between translations and speaking fluency. For this purpose and following an experimental design, three groups of Iranian advanced EFL learners were...

متن کامل

THE IMPACT OF USING COMPUTER-AIDED ARGUMENT MAPPING (CAAM) ON THE IMPROVEMENT OF IRANIAN EFL LEARNERS’ WRITING SELF-REGULATION

The present study was conducted to investigate the impact of using computer-aided argument mapping (CAAM) on the improvement of Iranian learners’ writing self-regulation. To this end, 90 participants out of 127 senior university students in English translation were selected after administrating language proficiency test, as well as an essay writing test for the purpose of homogenizing the learn...

متن کامل

Language Resources Factory: case study on the acquisition of Translation Memories

This paper demonstrates a novel distributed architecture to facilitate the acquisition of Language Resources. We build a factory that automates the stages involved in the acquisition, production, updating and maintenance of these resources. The factory is designed as a platform where functionalities are deployed as web services, which can be combined in complex acquisition chains using workflow...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014